Thesaurus-Based Search in Large Heterogeneous Collections

نویسندگان

  • Jan Wielemaker
  • Michiel Hildebrand
  • Jacco van Ossenbruggen
  • Guus Schreiber
چکیده

In cultural heritage, large virtual collections are coming into existence. Such collections contain heterogeneous sets of metadata and vocabulary concepts, originating from multiple sources. In the context of the E-Culture demonstrator we have shown earlier that such virtual collections can be effectively explored with keyword search and semantic clustering. In this paper we describe the design rationale of ClioPatria, an open-source system which provides APIs for scalable semantic graph search. The use of ClioPatria’s search strategies is illustrated with a realistic use case: searching for ”Picasso”. We discuss details of scalable graph search, the required OWL reasoning functionalities and show why SPARQL queries are insufficient for solving the search problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A faceted search system for facilitating discovery-driven scientific activities: a use case from functional ecology

To address biodiversity issues in ecology and assess the consequences of ecosystem changes, large quantities of long-term observational data from multiple data sets need to be integrated and characterized in a unified way. During these last decades, functional trait-based approaches have shown great potential to facilitate the understanding and the prediction of ecosystem changes. To promote da...

متن کامل

Supporting Semantic Image Annotation and Search

In this article we discuss an application scenario for semantic annotation and search in a collection of art images. This application shows that background knowledge in the form of ontologies can be used to support indexing and search in image collections. The underlying ontologies are represented in RDF Schema and are based on existing data standards and knowledge corpora, such as the VRA Core...

متن کامل

امکان‌سنجی طرح تدوین اصطلاح نامۀ مطالعات زنان و خانواده براساس استاندارد BS ISO 25964-1

Research Objective: Feasibility study of the Family and Women’s Studies Thesaurus considering the expansion of information in the field of women and family studies, as well as the wide span of related vocabulary and the development of vocabulary lists and bibliographies, the Family and Women’s Studies Thesaurus can be a professional tool for indexing and retrieval of women’s information in data...

متن کامل

Improving Content Based Image Retrieval Systems with a Thesaurus for Shapes

Successful retrieval of relevant images from large-scale image collections is one of the current problems in the field of data management. In this paper, I propose a system which combines traditional CBIR techniques, such as feature extraction, with a thesaurus for objects / shapes. A brief presentation of the problem area is given, along with the basics of a prototype system, the VORTEX engine.

متن کامل

Faceted Access to Heterogeneous Cultural Heritage Collections using Semantic Web Techniques

Integrated digital access to multiple collections is a prominent issue for many Cultural Heritage institutions. The metadata describing diverse collections must be interoperable, which requires aligning the controlled vocabularies that are used to annotate objects in these collections. We demonstrate an interface prototype presenting two collections whose vocabularies have been matched applying...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008